Speaker-and-Environment Change Detection in Broadcast News Using Maximum Divergence Common Component GMM

نویسنده

Yih-Ru Wang

چکیده

In this paper, the supervised maximum-divergence common component GMM (MD-CCGMM) model was used to the speaker-andenvironment change detection in broadcast news signal. In order to discriminate the speaker-and-environment change in broadcast news, the MD-CCGMM signal model will maximize the likelihood of CCGMM signal modeling and the divergence measure of different audio signal segments simultaneously. Performance of the MD-CCGMM model was examined using a four-hour TV broadcast news database. A result of 16.0% Equal Error Rate (EER) was achieved by using the divergence measure of CCGMM model. When using supervised MD-CCGMM model, 14.6% Equal Error Rate can be achieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker-and-environment change detection in broadcast news using the common component GMM-based divergence measure

In this paper, a GMM with common mixture components, referred to as the common component GMM (CCGMM), is proposed to be the signal model for calculating the diversity measure for the speaker-and-environment change detection in broadcast news signal. The use of GMM is to increase the accuracy of audio signal modeling while the use of common mixture components is to solve the complexity problem o...

متن کامل

On-line incremental speaker adaptation with automatic speaker change detection

In order to improve the performance of speech recognition systems when speakers change frequently and each of them utters a series of several sentences, a new unsupervised, online and incremental speaker adaptation technique combined with automatic detection of speaker changes is proposed. The speaker change is detected by comparing likelihoods using speaker-independent and speaker-adaptive GMM...

متن کامل

Universal Background Models for Real-time Speaker Change Detection

This paper addresses the problem of real-time speaker change detection in TV news broadcast, in which no prior knowledge on speakers is assumed. To remove the unreliable frames and background frames in the speech stream, we propose a new approach for feature categorization based on Gaussian Mixture Model Universal Background Model (GMM-UBM). The feature vectors are categorized into three sets, ...

متن کامل

Error Detection in Broadcast News ASR Using Markov Chains

This article addresses error detection in broadcast news automatic transcription, as a post-processing stage. Based on the observation that many errors appear in bursts, we investigated the use of Markov Chains (MC) for their temporal modelling capabilities. Experiments were conducted on a large Amercian English broadcast news corpus from NIST. Common features in error detection were used, all ...

متن کامل

Two step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures models

This paper addresses the topic of online unsupervised speaker segmentation in a complex audio environment as it is present in the Broadcast News databases. A new two stage speaker change detection algorithm is proposed, which combines the Bayesian Information Criterion with an ABLS-SCD statistical framework where adapted Gaussian mixture models are used to achieve higher accuracy. To enhance th...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Speaker-and-Environment Change Detection in Broadcast News Using Maximum Divergence Common Component GMM

نویسنده

چکیده

منابع مشابه

Speaker-and-environment change detection in broadcast news using the common component GMM-based divergence measure

On-line incremental speaker adaptation with automatic speaker change detection

Universal Background Models for Real-time Speaker Change Detection

Error Detection in Broadcast News ASR Using Markov Chains

Two step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures models

عنوان ژورنال:

اشتراک گذاری